Language Accent Detection with CNN Using Sparse Data from a Crowd-Sourced Speech Archive

نویسندگان

چکیده

The problem of accent recognition has received a lot attention with the development Automatic Speech Recognition (ASR) systems. crux is that conventional acoustic language models adapted to fit standard corpora are unable satisfy requirements for accented speech. In this research, we contribute task group up nine European accents in English and try provide some evidence favor specific hyperparameter choices neural network together search best input speech signal parameters ameliorate baseline accuracy. Specifically, used CNN-based model trained on audio features extracted from Accent Archive dataset, which crowd-sourced collection recordings. We show harnessing time–frequency energy (such as spectrogram, chromogram, spectral centroid, rolloff, fundamental frequency) Mel-frequency cepstral coefficients (MFCC) may increase accuracy classification compared feature sets MFCC and/or raw spectrograms. Our experiments demonstrate most impact brought about by amplitude mel-spectrograms linear scale fed into model. Amplitude scale, correlates energy, allow produce state-of-the-art results brings Germanic, Romance Slavic ranged 0.964 0.987; thus, outperforming existing classifying use Archive. also investigated how rhythm affects Based our preliminary experiments, recordings their original form (i.e., all pauses preserved) other experiments.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural Language Query Refinement for Problem Resolution from Crowd-Sourced Semi-Structured Data

We study the problem of natural language query generation for decision support systems (DSS) in the problem resolution domain. In this domain, a user has a task he is unable to accomplish (eg. bluetooth headphone not playing music), which we capture using language structures. We show how important units that define a problem can robustly and automatically be extracted from large noisy online fo...

متن کامل

Using Qualitative Spatial Logic for Validating Crowd-Sourced Geospatial Data

We describe a tool, MatchMaps, that generates sameAs and partOf matches between spatial objects (such as shops, shopping centres, etc.) in crowd-sourced and authoritative geospatial datasets. MatchMaps uses reasoning in qualitative spatial logic, description logic and truth maintenance techniques, to produce a consistent set of matches. We report the results of an initial evaluation of MatchMap...

متن کامل

Crowd - sourced

While many Learning Content Management Systems are available, the collaborative, community-based creation of rich e-learning content is still not sufficiently well supported. Few attempts have been made to apply crowd-sourcing and wiki-approaches for the creation of elearning content. In this article, we showcase SlideWiki -an Open Courseware Authoring platform supporting the crowdsourced creat...

متن کامل

Crowd Sourced Data for the Social Sciences

Tools deployed by the social scientist are both qualitative and quantitative: analysis is based on primary data such as surveys, questionnaires and interviews, or on secondary data associated with statistics collected and collated by primary agencies (ESRC, 2009). In the last two years, there has been a distinct sea change in our ability to collect and mine both qualitative and quantitative dat...

متن کامل

Speaker-basis Accent Clustering Using Invariant Structure Analysis and the Speech Accent Archive

English is the only language available for global communication and is used by 1.5 billions of speakers. It is also known to have a large diversity of pronunciation due to the influence of speakers’ mother tongue, called accents. Our project aims at creating a global and speaker-basis map of English accents to be used in learning World Englishes as well as research studies of World Englishes [1...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2022

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math10162913